PyDigger - unearthing stuff about Python


NameVersionSummarydate
upspawn-ocr-cli 0.1.0b3 Modern, polished CLI to extract text from PDFs using the Mistral OCR API. 2025-08-15 23:24:29
hashub-docapp 1.0.0 Professional Python SDK for the HashubDocApp API - Advanced OCR, document conversion, and text extraction service 2025-08-15 12:09:58
kokoro-tts 2.2.1 A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents. 2025-08-14 22:13:00
streamlit-pdf 1.0.6 A Streamlit component for viewing PDF files 2025-08-14 20:48:20
llama-index-packs-resume-screener 0.9.1 llama-index packs resume_screener integration 2025-08-14 20:17:36
bulkinvoicer 0.1.0.dev1 A simple python script to quickly create bulk invoices. 2025-08-14 18:54:29
plutoprint 0.3.0 Paged HTML rendering library 2025-08-14 12:52:45
web2llm 0.5.1 A tool to scrape web content into clean Markdown for LLMs. 2025-08-14 08:53:14
aspose-cells 25.8.0 Aspose.Cells for Python via Java is a high-performance library that unleashes the full potential of Excel in your Python projects. It can be used to efficiently manipulate and convert Excel and spreadsheet formats including XLS, XLSX, XLSB, ODS, CSV, and HTML - all from your Python code. Amazingly, it also offers free support. 2025-08-14 02:30:43
pdfix-sdk 8.7.3 PDFix SDK - Automated PDF Remediation, Data Extraction, HTML Conversion 2025-08-14 00:23:04
llm-markdownify 0.3.0 Convert PDFs, images to high-quality Markdown using Vision LLMs. 2025-08-13 22:07:30
inkognito 0.1.0 Privacy-first document processing FastMCP server with PII anonymization 2025-08-13 17:45:52
lizeur 0.1.3 Lizeur is a MCP server to be able to get content from PDFs. 2025-08-13 17:21:00
aspose-words-cloud 25.8.0 Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. 2025-08-13 12:45:20
surya-ocr 0.15.4 OCR, layout, reading order, and table recognition in 90+ languages 2025-08-12 23:21:48
docling 2.44.0 SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. 2025-08-12 09:52:48
pdfkb-mcp 0.4.1 A Model Context Protocol server for managing PDF documents with vector search capabilities 2025-08-12 04:10:04
diffpy.cmi 0.0.1 Complex modeling infrastructure: a modular framework for multi-modal modeling of scientific data. 2025-08-11 15:54:09
ipxact2systemverilog 1.0.26 Generate VHDL, SystemVerilog, html, rst, md, pdf, c headers from an IPXACT description 2025-08-11 11:20:59
docstrange 1.1.3 Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, JSON, CSV, HTML) with intelligent content extraction and advanced OCR. 2025-08-11 07:10:23
hourdayweektotal
39199910375311857
Elapsed time: 2.87676s